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Certain radars, sonars, and other sampling instruments periodically 
measure a variable but can measure accurately only if prediction equations 
provide the instrument with an accurate prediction of the next value of this 
variable. For these instruments, it is appropriate to define reliability as the 
probability that the error in this prediction does not exceed some limit. In 
choosing the form and parameters of the prediction equations, it is reasonable 
to attempt to maximize this reliability. 

Assumptions that the prediction equations utilize linear error measure- 
ments, are recursive, and provide least-squares smoothing with an exponential 
weighting function establish a realistic basis for calculating the reliability. 
The first through the third orders of these equations predict the variable as 
reliably as possible in the presence of large initial errors in the variable and 
its velocity, provided that the smoothing interval of the prediction equations 
is sufficiently short. The dynamic error component of the prediction error 
of these equations is proportional to a smoothed version of the qth time- 
derivative of the variable, where q is the order of the prediction equations. 
The assumption that the measurement errors are uncorrelated and stationary 
makes it possible to calculate the standard deviation of the random com- 
ponent of the prediction error. 

On the assumption that the random component of the prediction error has 
a normal (or Gaussian) probability distribution, there exists a safety factor 
which is monotonically related to the reliability. The choice of the smoothing 
interval of the prediction equations which maximizes this safety factor can 
be found, which in turn permits the optimum safety factor to be calculated. 
The ratio of pairs of these optimum safety factors determines which order 

* This paper reflects a study supported by the Army's Nike-X Project Office, 
Redstone Arsenal, Alabama. 
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of prediction equations gives the greatest reliability in the "worst case" 
situation in which the first unestimated time-derivative of the variable as- 
sumes its largest possible value. Graphs containing the foregoing results 
make it convenient to examine the tradeoffs between the reliability, the time 
between measurements, and other parameters. A numerical example is 
given. 
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1. INTRODUCTION 

Certain instruments attempt to determine the value of a variable by 
measuring the difference between it and a prediction of it at regular 
intervals of time. Examples of these instruments include echo-ranging 
radars and sonars. These measurements enter a computer which im- 
mediately substitutes the measurements into equations which predict 
the next value of the variable, thereby closing the loop illustrated in 
Fig. 1. This prediction must be supplied to the instrument because it is 
assumed that the instrument can measure sufficiently accurately only if 
it knows approximately where to look for the next value of the variable. 
Conversely, large errors in this prediction are assumed to blind or other- 
wise confuse the instrument. 

More precisely, it is assumed that positive limits L and ll on the 
prediction error E are given, where E is defined as the difference between 
the prediction x of the variable and its true value x. It is assumed that 
if E stays within the interval —L ^ E ^j L, the instrument almost 
never grossly mismeasures the variable. However, even when E stays 
within this interval, it is assumed that the instrument slightly mis- 
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Fig. 1 — Tracking servomechanism. 

measures the variable by contributing an additive random error e to 
the measurements E' of the prediction error. The quantities L, L , E, 
x, x, e, and E' are illustrated in Fig. 2. Because E' enters the prediction 
equations, x contains random error caused by e. These random errors 
help make it possible for E to exceed L or to become less than — L . An 
example of the situation described in this paragraph is given in Ref. 1. 

Reliability is defined herein as the probability that E stays within 
the interval —L'^E^L. Designers usually seem to feel that there is a 
need to maximize reliability defined in this manner. It must be admitted 
that different designers always seem to choose slightly different values 
of L and l! , but it almost always turns out that the results in this paper 
are not affected significantly by slight differences in these values. 

After selecting a relatively simple class of prediction equations which 
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assume that the variable is represented in a Taylor's series, this paper 
attempts to explain how to choose the order and the smoothing interval 
of these prediction equations. The criterion used in choosing these 
parameters is that of maximizing reliability at the instant when the 
variable x is exhibiting that behavior (termed "worst case") which tends 
to minimize reliability. Using the same class of prediction equations (in 
a form less convenient for computation than the equations of this paper) 
and some of the assumptions employed herein, R. G. Brown 3 proposed 
the same objective as that of this paper, but the ill-defined character of 
Brown's time-series (e.g., the history of an inventory) prevented him 
from explicitly determining an optimum order and smoothing interval. 
Reliability can be maximized by maximizing a quantity which is 
monotonically related to it. As discussed at the end of this section, it is 
assumed that the probability that E < —L can be neglected in com- 
parison with the probability that E > L. Assuming also that E is nor- 
mally distributed [Chapter 7 of Ref. 7], the reliability is a monotonically 
increasing function of the safety factor 

X = ^^ (1) 

where W is denned as the mean of E, so that W = E. Also, R is defined 
as R s E — W (so that R = 0), and o R denotes the standard deviation 
ofR. 

It is arbitrarily assumed that the prediction equations are linear dif- 
ference equations and that the mean value e„ of each of the measurement 
errors is zero. In consequence, the following simplifications are evident: 
(i) W, called the dynamic error, is caused solely by the inability of the 
prediction equations to predict more complicated time-histories of the 
variable than the equations are designed to predict, and (it) R, called 
the random error component of the prediction error, is caused solely by 
the measurement error. In consequence of the first simplification and of 
the fact that X in (1) decreases as the dynamic error W increases, the 
"worst case" behavior of the variable x is that behavior which maxi- 
mizes W. This maximum value of W is hereafter referred to as reaching 
its "worst case" value W c . 

The "worst case" value W c can be calculated with the aid of a theorem 
(given later) which implies that W is essentially proportional to the 
qth time-derivative of the variable; q = 1, 2, or 3 is the order of the pre- 
diction equations used in this paper. Consequently, it is necessary to 
know the "worst case" bounds (i.e., the greatest lower bound or the 
least upper bound) on the first, second, or third time-derivatives (here- 
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after referred to, respectively, as the velocity v, acceleration a, or jerk j) 
of the variable x. These "worst case" bounds often can be deduced from 
constraints imposed by physics or physiology. 

It is assumed that the random error component R has a normal (or 
Gaussian) probability distribution. This assumption is always satisfied 
if the measurement errors have normal probability distributions. 
[However, if the measurement errors are not normally distributed, the 
averaging effect of the prediction equations often makes the random error 
component distribution approximately normal (as suggested by the 
Central Limit Theorem)]. 7 Each measurement error e„ is assumed to 
occur in a random manner independently of every other measurement 
error, and all e n are assumed to have equal standard deviations. This 
standard deviation (or equivalently this rms value) need not be known 
prior to selecting the parameters of the prediction equations, but it must 
be known prior to calculating the reliability expected from the selected 
prediction equations. 

Just prior to (1), it was assumed that the probability that E < —L 
is negligibly small compared to the probability that E > L. This assump- 
tion is equivalent to the statement that, measured in units of size equal 
to o R , W c is much closer to L than to —L'. Alternatively, if W c is 
much closer to —h' than to L, the right side of (1) is replaced by 
(1/ + W c )/<tr ; this replacement does not change the arguments of this 
paper significantly. In the seemingly uncommon situation in which, 
after applying the results of Sections VI and VII, it turns out that W e 
is not much closer to one of the limits than it is to the other, the 
methods of this paper do not maximize the reliability even though X is 
maximum. The methods used in Section VI also assume that W c has the 
same sign as the limit to which it is much closer (i.e., W c positive if L is 
much closer, or W c negative if — l! is much closer); this assumption 
seems very likely to be satisfied. 

II. DETERMINING THE FORM OF THE PREDICTION EQUATIONS (PRIOR TO 
MAXIMIZING RELIABILITY) 

It is pertinent to list some reasonable objectives which the prediction 
equations should attain. 

To make it possible to achieve a level of reliability which is satisfac- 
tory at all times, provided that the reliability is large enough at some 
time, it is sufficient if W e and a R (and therefore also X) do not vary as 
time passes. If the standard deviation of the measurement errors and 
the behavior of the parameter do not vary with time, this constancy of 
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W c and ait can be achieved by using prediction equations whose form 
and parameters do not change as time passes. Such prediction equations 
effectively smooth over a constant interval of preceding measurements. 
(Prediction equations of this sort may exhibit an initial period, whose 
duration equals the smoothing interval, during which W and a R vary. 
In practice, reliability usually is not too small during this initial period, 
because the "worst case" behavior of the variable usually does not occur 
at this time.) 

The tradeoffs involved in maximizing the reliability of the prediction 
equations should be as obvious as possible. One method of achieving this 
is to make the prediction equations contain only a single independent 
parameter, such as the length of the interval over which preceding 
measurements are smoothed. 

The prediction equations should be easy to initialize and should readily 
provide information by which the track can be handed off to other pre- 
diction equations. These objectives suggest that the prediction equations 
should explicitly estimate the time-derivatives of the variable. 

The prediction equations should occupy as little storage space in the 
computer as possible. This objective suggests the use of recursive equa- 
tions, which require storing only the most recent values of all quantities 
appearing in these equations. The need for keeping computation time to 
a minimum implies that the prediction equations should be few and easy 
to calculate. 

It appears that the foregoing objectives can be attained by arbitrarily 
assuming that the variable is represented by a Taylor's series whose 
coefficients satisfy the "exponential smoothing" criterion. This criterion 
makes the expected (or mean) values of the sum of the exponentially 
weighted squares of the measured prediction error as small as possible. 
(A source of confusion may exist because the definition of the term 
"exponential smoothing" in this paper is more specific than another 
apparently obsolete definition of this term, which a few people have 
used to denote any set of recursive equations having constant coefficients.) 
Stating the exponential smoothing assumption algebraically, the quan- 
tity 

E E' n J K* 

[where E denotes the measurements of prediction error, where 



\N + 1, 
is the exponential weighting coefficient (with N ^ 1 being called the 
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smoothing interval), and n denotes the time of the latest prediction] 
represents the sum of the exponentially weighted squares of E . This 
sum is to be minimized by differentiating it separately with respect to 
each of the coefficients of the Taylor's series representing the variable, 
setting the differentials equal to zero, and solving the resulting set of 
equations simultaneously to obtain the optimum values of the Taylor's 
series coefficients. These steps are explained by Levine. 

Recursive prediction equations which satisfy this requirement can be 
obtained from Levine 2 by setting his weighting coefficient o>,- equal to 
K~\ taking the limit as n approaches infinity, and substituting the re- 
sults into his equations (15) through (17) or else (54) and (56). Similar 
prediction equations satisfying the exponential smoothing criterion 
appear in Refs. 3 or 11. Using Levine 's notation, the prediction equations 
for first, second, and third-order smoothing (corresponding to a Taylor's 
series containing one, two, or three coefficients) can be rearranged to 
minimize the amount of data-storage and computation time required. 
The rearranged prediction equations are given in the next three sub- 
sections. 

2.1 First-Order Prediction Equations 

.f, 1+ i = x n + OCiE n (2) 

E n ' = X n - Xn ( 3 ) 

ax = 1 - K (4) 

K-fc\, **1. («) 

where N is a constant, where x is the predicted value of the parameter, 
and where £ denotes the "observed" value of the variable, which can be 
calculated by adding x to the measured prediction error E . (The sub- 
script 1 in ai is different from Levine's subscript in that now the sub- 
script denotes the order of the smoothing equations instead of denoting 
the time of the measurement being processed by Levine's equations.) 

2.2 Second-Order Prediction Equations 

U n+ 1 = Un + fcEn (fi) 

X n+ i = X n + U n +l + CtiEn (7) 

ft = (1 - Kf (8) 

a 2 = 1 - K 2 (9) 
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Un = V n T, (10) 

where K and E' are defined in (3) and (5), where i) is the predicted 
velocity, and where T is the constant interval of time between the 
measurements of E . Equation (6) is supposed to be computed im- 
mediately before (7) is computed. As a result, u and x can be stored in 
a single word of computer memory each. A similar statement can be 
made about §, u, and x in the next three equations. 

2.3 Third-Order Prediction Equations 

S»+i - 6 n + yzEn (11) 

4.+1 = Un + 2I„ +1 + &#„' (12) 

Xn+1 = X n + tin+l ~ L+l + OC 3 E n ' (13) 

y 3 = Hl-K) 3 (14) 

ft = 1(1 -K 2 )(l -K) (15) 

a 3 = 1 - K 3 (16) 

^n = \&nT\ (17) 

where u, K, and ^ are defined in (10), (5), and (3), and where & is the 
predicted acceleration. 

For convenience, a graph of the coefficients calculated in (4), (8), and 
(9), and (14) through (16) is presented in Fig. 3. Equations (2), (6), 
and (7), or (11) through (13) are assumed to be initialized by using a 
priori estimates of the variable and (if required) its velocity and ac- 
celeration to calculate the initial value of x, u, and § with the aid of (10) 
and (17). Conversely, (2), (6), and (7), or (11) through (13) together 
with (10) and (17) provide estimates of x, v, and d which can be used 
to transfer (or hand off) the track to another set of prediction equations 
by providing initial predictions for this new set. [Of course, d is available 
only from (11) and i) from (12) or (6).] These predictions can be ex- 
tended to any future time merely by using a Taylor's series. For example, 



(!)^ + Q,)V (is) 



Xn+U/T) = X n + I ^ 

Alternatively, if it is desired to predict x, u, and s at any integral multiple 
of T in the future, it suffices merely to calculate (2), (6), and (7), or 
(11) through (13) in appropriate number of times but with the coef- 
ficients a, /3, 7 set equal to zero. 

The single independent parameter N in (5) is identified as the smooth- 
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Fig. 3 — Coefficients of exponential smoothing equations. 

ing interval because (2), (6), and (7), and (11) through (13) have nearly 
the same amounts of dynamic and random error components as poly- 
nomial smoothing equations (Ref. 4) designed to smooth over the last 
N samples. Additionally, K is approximated closely by exp (-2/JV) for 
all N ^ 1.5, thereby making N be approximately the exponential decay 
constant of y/~K (which in Levine's paper 2 is set equal to the reciprocal 
of the standard deviation). 



III. COMPARISON OF THE EXPONENTIAL SMOOTHING CRITERION WITH 
OTHER OPTIMIZATION CRITERIA 

A different set of prediction equations designed specifically for maxi- 
mum reliability in acquiring the track, under the assumptions that the 
standard deviations of the errors in the initial values of x and -0. (supplied 
from some external source of information) are much larger than L and 
that the standard deviation of the measurement error is much smaller 
than L, would predict the second value of x by adding the a prwri esti- 
mate of u to the first value of I: This procedure permits the initial 
velocity to be in error by as much as L/T without E ever exceeding L. 
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Fig. 4 — Maximum permissable initial error in velocity for (6) and (7) and 
(11) through (13) divided by L/T; (b) second order; (c) third order. 

A comparison between the effectiveness of (G) and (7) or (11) through 
(13) and the above prediction equations is shown in Fig. 4, which shows 
the interval of permissable initial velocities of (6) and (7) or (11) 
through (13) divided by L/T. (Simulations reveal that the region of 
permissable initial positions and velocities approximates a horizontal 
rectangle on the prediction error phase plane, whose position error axis 
is intersected by the sides of the rectangle at ±L.) Fig. 4 indicates that 
if N is small enough, (6) and (7) and (11) through (13) perform as 
reliably as the above set of prediction equations because the ratios are 
unity. 

Evaluating the extent to which (6) and (7) satisfy another optimiza- 
tion criterion, Benedict and Bordner 5 state that critically damped equa- 
tions equivalent to (6) and (7) give virtually the same accuracy as the 
set of recursive second-order smoothing equations which minimize a 
weighted sum of the random and the dynamic errors. 



IV. DYNAMIC ERROR COMPONENT CALCULATIONS 

The following theorem, which is illustrated in Fig. 5, provides a basis 
for calculating the dynamic error. 
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Fig. 5 — Illustration of the theorem. 



Theorem: Let q = 1,2, or 3 denote the order of the prediction equations, and 
let the qth difference of the variable x be defined as (z — l) q x, where z is 
the advance operator (which denotes taking the next sample of the time series 
on which it is operating, as in the example (z — l).r„ = x n +i — x»). Then, 
the dynamic error component equals the product of — [(N + l)/2]' and the 
final output of q identical first-order prediction equations having the same 
smoothing interval as the original prediction equations and forming a series 
connection in which the output of one equation is the input to the next 
equation, with the initial input being the qth difference of the variable. A 
proof of the theorem is given in Appendix A. 

A natural and intuitively satisfying interpretation of this theorem is 
obtained from the fact that the qth difference of the variable closely ap- 
piwr'mates the qth derivative of the variable multiplied by T 9 . The q 
cascaded first-order filters, acting like simple low-pass resistance-capaci- 
tance filters, tend to attenuate all rapid fluctuations in the qth derivative 
and to delay the change in the dynamic error caused by a nonzero 
q + lth derivative by q(N + 1 )/2 samples. Thus, to estimate the worst 
case dynamic error of the 9th order prediction equations, it suffices simply 
to multiply the maximum value of the variable's gth derivative (aver- 
aged over a total smoothing interval of qN samples) by —T q [(N +1)/ 
2f, where q = 1,2, or 3. 

To complete this treatment of dynamic error, the dynamic error com- 
ponent of u or s can be defined as u or s minus the first difference or one 
half of the second difference of the variable. [In computing the dynamic 
error of u for the third-order prediction equations, it was necessary to 
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define the true velocity operator as \ (z — 1/z), instead of (z — 1 ).] It is 
possible to use the methods of Appendix A to prove theorems about 
these dynamic errors; these theorems can be interpreted in nearly the 
same way as in the previous paragraph. These results, along with those 
of the previous paragraph, are presented in Fig. 6. The dynamic error 
coefficients are to be multiplied by — vT in the first-order case, — aT 2 in 
the second-order case, and — jT 3 in the third-order case. The delays 
through the various connections of first-order filters (each of which 
introduces a delay of NT/2) are contained in Table I. The dynamic 
error coefficients plotted in Fig. 6 are listed in Table II. 

V. RANDOM ERROR COMPONENT CALCULATIONS 

Fig. 7 shows the coefficients of the standard deviation (or rms value) 
of the random error component of x, u, and §, and Table III lists the 
asymptotic behaviors of these coefficients for large values of N. (Fig. 7 
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Fig. 6 — Dynamic error — first order, (a) x; second order, (b) x, (c) ti; third 
order, (d) x, (e) d, (f) s. 
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Table I — Delays of Dynamic Error for gru Order 
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was obtained by numerically calculating the square root of the sums of 
squares of the quantities estimated by (2), (6), and (7) and (11) 
through (13) in response to x = 1, £,• = 0, i > [as in Ref. 5: (10) and 
Appendix I]. The actual standard deviations can be obtained by multi- 
plying the coefficients shown in Fig. 7 and Table III by the standard 
deviation <r f of the measurement error. 

Fig. 8 gives the correlation coefficients p (defined as the covariance 
divided by the product of the standard deviations of the quantities ap- 
pearing in the covariance calculations), and Table IV lists their asymp- 
totic values as N becomes very large. These correlation coefficients to- 
gether with the standard deviation coefficients can be used to calculate 
the standard deviation of a prediction extended to any future time, be- 
cause taking the expected value of the square of both sides of (18) gives 
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Table II — Dynamic Error Coefficients 





9 = 1 


2 


3 




Di 2 


D (N+ 1)» 


D (N + i) 3 




D * 4 


u, g 




— 


N + 1 


(N + l) 2 


fc 2 I 2 1 




4 


_" N + 1 (AT+1) J _ 


s 


— 


— 




3(JV + 1) 

4 



2350 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1965 



Table III — Asymptotic Behaviors of the Standard Deviation 

Coefficients 
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Fig. 7 — Standard deviations — first order, (a) x; second order, (b) x, (c) fl; 
third order, (d) x, (e) u, (f) s. 
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Fig. 8— Correlation coefficients — second order, (a) x, u; third order, (b) 
x, u (c) u, s, (d) x, S. 
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VI. CHOOSING THE SMOOTHING INTERVAL FOR MAXIMUM RELIABILITY 

Substituting the expressions for W and a R shown as graphs on Figs. 
6 and 7 into (1) to obtain the "worst case" safety factor produces, with 
v c , a c , and j c denoting the "worst-case" velocity, acceleration, and jerk, 



Xi = 



L 



v c T 



- -Di 



Si 



v e T\ 



or 



X 2 = 



L 



a c T 2 



- D, 



S, 






(20) 



or 



X 3 = 



jrT* 



- D 3 



\JcT 3 \ 



S 3 



where D and S, respectively, denote the dynamic and random error coef- 
ficients of the predicted value of the variable, and where the subscripts 
denote the order q of the prediction equations. To carry out the maximi- 
zation by choosing iV to maximize X, it suffices to consider only the first 
factor in each of the three expressions in (20), because only D and S 
are functions of N. Although this statement is always true, X does not 
completely determine the reliability if the dynamic error remains at its 
"worst case" value W c for more than approximately iV samples, because 
E would have two or more independent opportunities to exceed L. How- 
ever, it is believed that this additional source of unreliability can be 
disregarded when choosing ./V to maximize the reliability, because as N 
varies the reliability depends on X much more strongly than it depends 
on this additional source. 

The first factors in (20) can be written as 



X, = 



where 



, Pi - Dr 



S t ' 



or Xo = 



P 2 - D 2 



or Xs = 






Pi = 



L 



v e T 



or P 2 = 



L 



a e T* 



orP 3 = 



L 



jcT* 



(22) 



The values of N which maximize x' are shown in Fig. 9 as a function of 
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Fig. 9 — Optimum smoothing intervals versus protection parameters — (a) 
first order, (b) second order, (c) third order. 

P, and the corresponding maximum values of \' are shown in Fig. 10. 
Asymptotic formulas for these optimum values of N and X' for large 
values of P can be obtained with the aid of Tables II and III as 



tfiopt = 2/3 Pi 
tf2.pt - 0.903 VK 

N 3nnt = 1.045 ^P 3 



3/2 



X lopt ' = 0.544 (Pi) 
A 2opt ' = 0.478 (P 2 ) 5/4 
X 3opt ' = 0.432 (P 3 ) 7/6 . 



(23) 



(24) 



These formulas provide very close approximations to Figs. 9 and 10 
for JV op t > 2.5 approximately. 

An important relationship obtained by substituting (22) into (23) is 
that, for all values of A^ greater than approximately 2.5, the effective 
smoothing time A^pt T equals a constant (which depends only on L and 
the "worst case" value of the gth time derivative of the variable). 
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Fig. 10 — Optimum safety factors versus protection parameters — (a) first 
order, (b) second order, (c) third order. 

A significant conclusion may be drawn from the fact that the function 
X' in (21) is independent of the standard deviation of the measurement 
error <r ( . Consequently, a single value of N (i.e., iV opt ) simultaneously 
maximizes the reliability for all <r t . 



VII. OPTIMIZING THE ORDER OF THE PREDICTION EQUATIONS 

The order of the prediction equations can always be chosen to maxi- 
mize the reliability by using the order which gives the largest value of 
the safety factor X defined by (20) and shown in Fig. 10. A more general 
comparison between Xi , X 2 , and X 3 can be obtained for sufficiently large 
values of optimum smoothing intervals (i.e., iV op t > 2.5 approximately) 
by using the asymptotic formulas (24) to obtain the ratios, on the 
assumption that L = L, 
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Xi 1.14 1 L\ m | a c \ 1,A 



(25) 



X 2 | v c | 1 ' 2 

where v c and a c are the "worst case" velocity and acceleration, and 

x 11117" | 1/12 I ,' I'/fi 

M _ 1.11 1 ^ | \3c\ (2fi) 

where jc is the "worst case" jerk. 

If (25) is greater than unity, then the optimum first-order predic- 
tion equations [which are optimum in the sense that they use the values 
N opt given asymptotically by (23)] track more reliably than the optimum 
second-order prediction equations. Likewise, if (26) is greater than 
unity, then the optimum second-order prediction equations track more 
reliably than the optimum third-order prediction equations. The con- 
verse holds also. 

It is significant that, if the time T between measurements is not so 
large that N is too small for (25) and (26) to hold, these equations 
indicate that the optimum order q opt of the smoothing equations is 
optimum for all T (and <t ( ). 

Examples illustrating these methods are given in Appendix B. 

VIII. REDUCTION OF THE SAFETY FACTOR AND THE RELIABILITY DUE TO 
USING A NON-OPTIMUM SMOOTHING INTERVAL 

Ratios of the actual safety factor obtained by using an arbitrary value 
of N to the optimum safety factor obtained by using N opt from (23) can 
be obtained for large values of P by using the values of X op t' from (24) 
and using Tables II and III in conjunction with (21). A graph of these 
ratios is given in Fig. 11 as a function of N/N opt . This graph indicates 
that it is worse to use a value of N larger than A^ op t than it is to use a 
value proportionately smaller than JV pt • 

The reduction in reliability caused by accepting a safety factor smaller 
than optimum can be calculated from tables of the normal distribution 
function [pp. 966-72 of Ref. 8]. For example, if X = 3, the unreliability 
(i.e., unity minus the reliability) is only 0.135 percent, but the unrelia- 
bility rises to 0.35, 0.82, 1.79, or 6.68 percent if X decreases by 10, 20, 
30, or 50 percent, respectively. 

IX. CHOOSING THE TIME T BETWEEN MEASUREMENTS 

The assumptions of this paper are not sufficient to determine an 
optimum value of T, but the methods of this paper do permit describ- 
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Fig. 11 — Relative safety factor versus the ratio of actual decay constant to 
the optimum decay constant. 

ing the effects of various choices of T, if it is assumed that the optimum 
value of N is used and that JV pt is large enough (i.e., A^ opt > 2.5 ap- 
proximately) so that iVopt T equals a constant. 

One effect is that as T decreases (and N opt increases), the maximum 
initial velocity error (for which E barely keeps from exceeding L) in- 
creases because it equals L/T until the appropriate break point shown 
in Fig. 4 is reached, beyond which point the maximum tolerable initial 
velocity error remains constant as T decreases. 

Another effect is that as T increases, the standard deviations of .£ , 
0, and A [the latter two quantities being related to u and § by (10) and 
(17)] increase proportionately to \/T. In consequence, an increase of T 
causes the safety factor to decrease as l/\/T, because a R is proportional 
to s/T, and because W is independent of T (due to the assumption that 
AT opt is large). 



X. CONCLUSION 



Except for the previous section, the objective of this paper has been 
the combining of limits on the maximum tolerable prediction error L 
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and the "worst case" time-derivatives v c , a c , or j e with a time T between 
measurements, for the purpose of determining the optimal order q op t 
and smoothing interval N op t of the prediction equations. The results of 
this paper make it possible to perform parameter-variation studies in 
which q op t and iVopt are assumed to be used, and in which tradeoffs be- 
tween the following quantities are examined: (*) L, (ii) T, (Hi) the 
standard deviation a t of the measurement error, (iv) the reliability 
(defined as the probability that — L'^E^L&t the time of the "worst 
case" value v c , a c , or j c ), and (v) the "worst case" value v c , a c , or j c . 
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APPENDIX A 

Proof of Dynamic Error Theorem 
The first-order prediction (2) can be written in the operator notation 



as 



x = 



Z + ai — 1 



(27) 



where z is the advance operator for which z(x n ) = x a +i . Calculating the 
dynamic error component of the prediction error E is accomplished by 
subtracting the true value x of the variable from x and then by setting 
the observed value x of the variable equal to its true value x, because 
setting the (random) measurement error e = x — x equal to zero re- 
moves the random error component from E. Thus, the dynamic error 
component of E equals 



x — x = — x — x = 



-2_-ll 

«1 — 1 J 



■ + *" ' " L. + a.-i r (28) 

1 l (V ' V(z-D*]. 



<X\ \z + «i — 1 
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According to (4) and (5), l/«i = (N + l)/2, so that the dynamic error 
equals 

£±± C z "' , ) [-(• - l)xl (29) 

2 V + oti — 1/ 

Equation (27) shows that (29) denotes passing the negative of the first 
difference of x through a first-order prediction equation which is identical 
to the first-order prediction equation being considered. 

The second -order equations (6) and (7) can be written in operator 
notation and simultaneously solved for x to give 

„ [QiZ + a 2 (z - l)]x / 3() n 

X fa + a 2 (z - 1) + (z- l) 2 ' 

Computing the dynamic error by subtracting x from both sides and 
setting x = x as before makes (30) become 

-(z- l)*x 



x — X = 



fa + a 2 (z - 1) + (z - l) 2 
Using (8) and (9) and then (4) and (5) gives 

-(z- l) 2 x 



(3i: 



X — X = 



'—{z— l) 2 x] \ «i 



D 2 x] f 



(32) 



«i 2 [_z — 1 + Ct\_ 

Equation (32 ) represents two cascaded first-order prediction equations, 
whose smoothing interval N is the same as the smoothing interval of the 
original second-order prediction equations, operating on the negative of 
the second difference of the variable, as stated in the theorem. 

The third-order prediction equations (11) through (13) can be writ- 
ten in operator notation and solved simultaneously to give 

„ _ [ a 3 (z - l) 2 + /Mz - 1) +73ZU + 1) 1 • 

X _(z - l) 3 + a 3 (z - l) 2 + foz{z - 1) 4- y 3 z(z + 1)J ' 

so that the dynamic error equals 

_ [~(z- l) 3 x] 

X (Z - l) 3 + <X*(Z - I) 2 + P*Z{Z - 1) + 732(2 + 1) * 

Using (14) through (16) gives 



(33) 



(34) 
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i-(z- l)'x] 






(z - K)* 



-m( 



3 

cxi \ r / i \3 



z — 1 + ai 



(35) 



[-(*- D'4 



where the smoothing interval A/ of the three cascaded first-order pre- 
diction equations is the same smoothing interval as that which appears 
in the original third-order prediction equations, as stated in the theorem.* 

The abridgment of this theorem obtained by omitting the q cascaded 
first-order prediction equations {i.e., the dynamic error is approximated 
by multiplying the unsmoothed qth difference by — [(N +■ l)/2] 9 } has 
approximated the dynamic error very accurately in simulations of the 
slowdown of ballistic devices re-entering the earth's atmosphere. 

As an application of the theorem, the maximum dynamic error com- 
ponent for the second-order prediction equations (6) and (7) for an 
initial error u ' in u can be approximated by performing the steps listed 
in the theorem. Thus, the second difference of x = u 'n for n ^ and 
x = for x < is an impulse of height u ' . This impulse enters two 
cascaded first-order prediction equations, whose combined impulse 
response {obtained by convolving two exponentials together and multi- 
plying by [(N + l)/2] 2 } is approximately equal to u 'n exp ( — 2n/N). 
The maximum value of this impulse response occurs at n = N/2, so that 
the maximum value of the dynamic error component of the prediction 
error E equals approximately m (N/2e). 

Similarly, it is possible to calculate the dynamic error due to an initial 
error s ' in § in the third-order equations (11) through (13) since the 
third difference of x = 8 V for n ^ and x = for n < is two suc- 
cessive impulses of height equal s . These two impulses can be regarded 
as having the effect of a single impulse of height 2sd if the smoothing 
interval N is sufficiently large compared to two. The impulse response 
of three cascaded first-order prediction equations is 

(n 2 /2)exp[-2(n- l)/(N + 1)1 



* For nth order exponential smoothing (as opposed to prediction) equations in 
which p is any integer, it is possible to prove that the dynamic error defined as 
x — x is equal to [(AT — \)/2\ p times the negative of the nth difference (defined as 
[1 — (l/z)l''| and passed through p first-order exponential smoothing equations of 
the same smoothing interval. (This proof starts with (7) of Ref. 10 and uses the 
lemma 

( 2 - DBi = -^— le - Six, 
1 — «1 

where S is an operator denoting a first -order exponential smoothing equation.) 
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after multiplying by the [(N + l)/2] 3 factor. Thus, the dynamic error 
component of the prediction error E can be approximated by SoV exp 
[— 2(n — 1)/(N + 1)]. The maximum of this approximation occurs at 
n = N + 1, so that the maximum dynamic error equals approximately 
So (N + l) 2 exp [— 2N/(N +1)]. (This approximation is never more 
than 12 percent larger than the true value of the dynamic error for any 
N ^ 7. The expression s ' (N + l) 2 exp (—2) provides an approxima- 
tion which is always smaller than the true value of the dynamic error. ) 
To conclude these examples of the use of the theorem, the dynamic 
error components of the prediction error E for the third-order prediction 
equations (11) through (13) caused by an initial error u Q ' or Xo in H or 
£ can be approximated respectively by the first or second derivative 
with respect to n of \n exp ( — 2n/N) multiplied by u ' or .t . 

APPENDIX B 

Examples 

It is necessary to illustrate how the assumptions and results discussed 
in the text apply to an actual instrument. Thus, echo-ranging radars 
have measurement errors which occur independently of the measurement 
error on any other pulse. The reason for this independence is that the 
interval T between measurements is always much larger than the re- 
ciprocal of the bandwidth of the instrument, because the time for the 
echo to return from the target always greatly exceeds the duration of 
the echo. 

In monopulse radars measuring the range and angles of a target, ther- 
mal noise originating in the receiver often results in a t being equal to the 
product of the following two quantities: (i) a large fraction of the pulse- 
width or beamwidth, and (ii) the reciprocal of the square root of the 
ratio of the peak signal power to the average noise power [Chapter 10 of 
Ref. 6]. If quantization errors are present because the instrument meas- 
ures the prediction error digitally, <r, can usually be calculated by taking 
the square root of the sum of squares of the standard deviations of the 
thermal and quantization errors. 

At the end of Section VI of the text, it is stated that a single value of 
the smoothing interval N simultaneously maximizes the reliability for 
all <r t . In radar terminology, this value of N (i.e., AT opt ) is optimum for 
any ratio of signal power to noise power. Likewise, Section VII concludes 
with a result which can be interpreted as stating that the optimum value 
g-opt of the order of the prediction equations is optimum for any data 
rate and any ratio of signal power to noise power. 
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The use of some of the equations and graphs presented previously is 
exemplified by the following: A radar emitting a 100 foot-wide pulse 
every 0.27 seconds is tracking the range of a descending spaceship whose 
maximum velocity is —35,000 ft/sec, whose maximum acceleration is 
320 ft/ (sec) 2 , and whose maximum jerk is simulated to be 8.3 ft/ (sec). 3 
(These maxima occur at different times during the re-entry of the 
spaceship into the atmosphere.) It is discovered that the radar can meas- 
ure the prediction error sufficiently accurately only if the prediction 
error | E | is smaller than half the pulsewidth, so that L = 50 feet.* The 
standard deviation a f of the measurement error e is found to be ap- 
proximately equal to L/\/S/N, where S/N denotes the ratio of the 
instantaneously maximum signal power to the average noise power in 
the output of the radar's IF strip. 

Equation (25) reveals that second-order smoothing is more reliable 
than first-order smoothing if L < 2.26 X 10 6 feet, and (26) reveals that 
third-order smoothing is more reliable than second if L < 4.36 X 10 5 
feet. In consequence of assuming L to be only 50 feet, third-order smooth- 
ing should be used. 

Using (22) to calculate P 3 gives P 3 = 306. Fig. 9 indicates that A^ 0P t = 
7, and Fig. 10 indicates that X' = 275. Computing X by multiplying x' 
by I jcT 3 1 A. gives X = 0.9 VsjN. If S/N = 16 = 12 db, X = 3.6 and 
the "worst-case" reliability is 99.98 percent, according to pp. 966-972 
of Ref. 8. Similarly, if S/N = 4 = 6 db, X = 1.8, and the "worst-case" 
reliability is 96.4 percent. The values of a 3 , ft , and y 3 corresponding to 
N opt = 7 can be calculated with the aid of (14) through (16) or looked 
up on Fig. 3 as a 3 = 0.578, ft = 0.164, and 73 = 0.0078, or, in exact 
octal fractions, a, = (0.45) 8 ,& = (0.124) 8 , 7 3 = (0.004) 8 . 

The "worst case" dynamic error component W c can be calculated with 

* Boom use of this assumption that L' = L, it is also assumed that the (tynamic 
error | W r | is large enough and an is small enough so that 

or or 

thereby making the reliability be significantly affected only by changes in the 
value of X. This assertion can be verified by calculating 

L + I W t I , L - I W c I 
and 



with the aid of the values of W c and a R given in the last paragraph of this appen- 
dix and using the tables of the normal distribution function (pp. 966-972 of Ref. 
8) to compare the effects of changes in 

I + I W e I , L - \W e \ 
and . 
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the aid of Fig. 6 as 64 times j c T , or 10.5 feet. The standard deviation o- H 
of the random component can be calculated with the aid of Fig. 7 as 
0.88 <r t = U'/ VS/N. If S/N = 4 = 6 db, <r s = 22 feet and if S/N = 
16 = 12 db, <j R — 11 feet. As a check on the consistency of the results, 
(1) gives values of X which are identical to those calculated in the 
previous paragraph. 
•ii 
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